Estimation of phylogeny and invariant sites under the general Markov model of nucleotide sequence evolution.

نویسندگان

  • Vivek Jayaswal
  • John Robinson
  • Lars Jermiin
چکیده

The models of nucleotide substitution used by most maximum likelihood-based methods assume that the evolutionary process is stationary, reversible, and homogeneous. We present an extension of the Barry and Hartigan model, which can be used to estimate parameters by maximum likelihood (ML) when the data contain invariant sites and there are violations of the assumptions of stationarity, reversibility, and homogeneity. Unlike most ML methods for estimating invariant sites, we estimate the nucleotide composition of invariant sites separately from that of variable sites. We analyze a bacterial data set where problems due to lack of stationarity and homogeneity have been previously well noted and use the parametric bootstrap to show that the data are consistent with our general Markov model. We also show that estimates of invariant sites obtained using our method are fairly accurate when applied to data simulated under the general Markov model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Analysis of Partitioned Data

—Variation in the evolutionary process across the sites of nucleotide sequence alignments is well established, and is an increasingly pervasive feature of datasets composed of gene regions sampled from multiple loci and/or different genomes. Inference of phylogeny from these data demands that we adequately model the underlying process heterogeneity; failure to do so can lead to biased estimates...

متن کامل

Haplotype Block Partitioning and tagSNP Selection under the Perfect Phylogeny Model

Single Nucleotide Polymorphisms (SNPs) are the most usual form of polymorphism in human genome.Analyses of genetic variations have revealed that individual genomes share common SNP-haplotypes. Theparticular pattern of these common variations forms a block-like structure on human genome. In this work,we develop a new method based on the Perfect Phylogeny Model to identify haplo...

متن کامل

Large-Scale Multiple Sequence Alignment and Phylogeny Estimation

With the advent of next generation sequencing technologies, alignment and phylogeny estimation of datasets with thousands of sequences is being attempted. To address these challenges, new algorithmic approaches have been developed that have been able to provide substantial improvements over standard methods. This paper focuses on new approaches for ultra-large tree estimation, including methods...

متن کامل

The Phylogeny of Calligonum and Pteropyrum (Polygonaceae) Based on Nuclear Ribosomal DNA ITS and Chloroplast trnL-F Sequences

This study represents phylogenetic analyses of two woody polygonaceous genera Calligonum and Pteropyrum using both chloroplast fragment (trnL-F) and the nuclear ribosomal internal transcribed spacer (nrDNA ITS) sequence data. All inferred phylogenies using parsimony and Bayesian methods showed that Calligonum and Pteropyrum are both monophyletic and closely related taxa. They have no affinity w...

متن کامل

Genetic Variation of Cytochrome b Gene in Three Local Goat Breeds in Central Java of Indonesia

This study was conducted to elucidate the genetic diversity and phylogenic relationship of three local goat breeds in Central Java of Indonesia based on the cytochrome b (cyt b) gene sequence. Blood samples of 19, 15 and 12 heads of Kacang, Kejobong and Ettawa Grade breeds, respectively, were used as materials of the study. The multiple alignment of cyt b gene was conducted by comparing under s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Systematic biology

دوره 56 2  شماره 

صفحات  -

تاریخ انتشار 2007